add nemo_bridge by chai-xiaonan · Pull Request #1050 · flagos-ai/FlagScale

chai-xiaonan · 2026-01-09T02:51:34Z

Reconstruct the Nemo-Bridge based on the restructured flagscale version. Currently, flagscale has supported some functions of nemo-bridge, enabling the flagscale framework to load and save ckpt in the hf format during the training process. Additionally, in the current version, new features have been added, allowing for the setting of the number of iterations for saving hf weights based on the save_hf_interval. The model has verified that Deepseek V3 16_a3B, Qwen3-32B, and Qwen3-0.6B all have correct accuracy.

lxd-cumt · 2026-01-09T03:06:30Z

flagscale/train/megatron/training/checkpointing.py

+            #Load the HF model from config
+            config_load = args.hf_config_path
+            config = safe_load_config_with_retry(config_load, trust_remote_code=False)
+            bridge = AutoBridge.from_hf_config(config)


Will this save-ckpt step allocate extra GPU memory when initializing an HF model?

lxd-cumt · 2026-01-09T03:09:20Z

flagscale/train/megatron/training/checkpointing.py

+        bridge.load_hf_weights(ddp_model)
+        # no optimizer weight
+        iteration=0
+        num_floating_point_operations_so_far=0


please add print_rank_0 here

lxd-cumt · 2026-01-09T03:11:35Z

flagscale/train/megatron/training/checkpointing.py

+        # use megatron bridge
+        from megatron.nemo_bridge.models import AutoBridge
+        bridge=AutoBridge.from_hf_pretrained(load_dir)
+        bridge.load_hf_weights(ddp_model)


Can nemo-bridge’s load_hf_model handle a ddp_model directly, where ddp_model is wrapped by DistributedDataParallel?

lxd-cumt · 2026-01-09T03:20:46Z

flagscale/train/megatron/nemo_bridge/__init__.py

@@ -0,0 +1,8 @@
+# Copyright (c) 2025, BAAI. All rights reserved.


nemo megatron-bridge supports pip install for usage, ref https://pypi.org/project/megatron-bridge/
please remove source codes

lxd-cumt · 2026-01-09T03:35:34Z

flagscale/train/megatron/nemo_bridge/__init__.py

@@ -0,0 +1,8 @@
+# Copyright (c) 2025, BAAI. All rights reserved.


Rename flagscale/train/megatron/nemo_bridge to flagscale/train/megatron/bridge so that it matches the import pattern from megatron.bridge

tengqm

When copy pasting source code from other repos, we are supposed/obliged to copy paste their copyright notice as well. We cannot claim copyrights for these code.
The original code has following copyright header to be preserved:

# Copyright (c) 2025, NVIDIA CORPORATION.  All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

tengqm · 2026-01-11T03:14:16Z

flagscale/train/megatron/nemo_bridge/models/qwen/qwen2_bridge.py

@@ -0,0 +1,110 @@
+# Copyright (c) 2025, BAAI. All rights reserved.
+#
+# Copied from: https://github.com/NVIDIA-NeMo/Megatron-Bridge


If Megatron-Bridge has a copyright claim, we are supposed to paste their copyright statements here.

…gScale into add_nemo_bridge

tengqm · 2026-01-16T02:49:35Z

flagscale/train/megatron/nemo_bridge/models/conversion/auto_bridge.py

+
+                if not has_implementation:
+                    raise ValueError(
+                        f"\n�~\~W Model architecture '{architecture}' is not yet supported\n\n"


What are these weird characters?
There are some other similar cases in this string.

tengqm · 2026-01-16T02:51:11Z

flagscale/train/megatron/nemo_bridge/models/conversion/model_bridge.py

@@ -0,0 +1,359 @@
+# Copyright (c) 2025, BAAI. All rights reserved.
+#
+# Mainly adapted from: https://github.com/NVIDIA-NeMo/Megatron-Bridge


Please clarify what has been "borrowed".
Please also paste the original copyright claim here if the code was not originally written by us.

tengqm · 2026-01-21T05:39:44Z

flagscale/train/megatron/nemo_bridge/models/conversion/auto_bridge.py

@@ -0,0 +1,202 @@
+# Copyright (c) 2025, BAAI. All rights reserved.


Looks to me that this file was largely adapted from flagscale/train/megatron/nemo_bridge/models/conversion/auto_bridge.py. We copy-pasted the source and we are claiming copyright for this code. This is not acceptable.

We can borrow code from other projects, provided that the license terms grant us this right. In that case, we still have to pay credit to the original authors. We are obliged to mention their copyrights.

There are some weird characters in this file which was obviously a character conversion problem during copy/paste. Please fix them as well.

chai-xiaonan · 2026-02-05T08:48:30Z

收到，谢谢

add nemo_bridge

d82146d

chai-xiaonan requested review from aoyulong, heavyrain-lzy and zhaoyinglia as code owners January 9, 2026 02:51

lxd-cumt reviewed Jan 9, 2026

View reviewed changes

tengqm reviewed Jan 11, 2026

View reviewed changes

chai-xiaonan and others added 7 commits January 15, 2026 16:34

Merge branch 'main' into add_nemo_bridge

a358436

Reconstruct the code, and some functions use pip megatron-bridge

90c2ed0

Merge branch 'main' into add_nemo_bridge

224f8e1

Merge branch 'flagos-ai:main' into add_nemo_bridge

c0b0f37

delete readme and swp file

2900956

Merge branch 'add_nemo_bridge' of https://github.com/chai-xiaonan/Fla…

0dd3b6b

…gScale into add_nemo_bridge

Merge branch 'main' into add_nemo_bridge

961b6c5

tengqm reviewed Feb 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add nemo_bridge#1050

add nemo_bridge#1050
chai-xiaonan wants to merge 8 commits intoflagos-ai:mainfrom
chai-xiaonan:add_nemo_bridge

chai-xiaonan commented Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Uh oh!

lxd-cumt Jan 9, 2026

Uh oh!

tengqm left a comment

Uh oh!

tengqm Jan 11, 2026

Uh oh!

tengqm Jan 16, 2026

Uh oh!

tengqm Jan 16, 2026

Uh oh!

tengqm Jan 21, 2026

Uh oh!

chai-xiaonan commented Feb 5, 2026 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,8 @@
		# Copyright (c) 2025, BAAI. All rights reserved.

		@@ -0,0 +1,202 @@
		# Copyright (c) 2025, BAAI. All rights reserved.

Conversation

chai-xiaonan commented Jan 9, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tengqm left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chai-xiaonan commented Feb 5, 2026 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants